Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 193 |
| Missing cells | 1071 |
| Missing cells (%) | 21.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 39.3 KiB |
| Average record size in memory | 208.7 B |
Variable types
| Text | 2 |
|---|---|
| Numeric | 24 |
1st_2nd_lda is highly overall correlated with 1st_2nd_stm and 2 other fields | High correlation |
1st_2nd_lda_adj is highly overall correlated with 1st_2nd_stm_adj and 3 other fields | High correlation |
1st_2nd_stm is highly overall correlated with 1st_2nd_lda and 1 other fields | High correlation |
1st_2nd_stm_adj is highly overall correlated with 1st_2nd_lda_adj and 3 other fields | High correlation |
1st_2nd_tfidf is highly overall correlated with 1st_2nd_lda and 2 other fields | High correlation |
1st_2nd_tfidf_adj is highly overall correlated with 1st_2nd_lda_adj and 4 other fields | High correlation |
1st_2nd_use is highly overall correlated with 1st_2nd_lda and 4 other fields | High correlation |
1st_2nd_use_adj is highly overall correlated with 1st_2nd_lda_adj and 5 other fields | High correlation |
1st_const_year is highly overall correlated with 1st_curr_lda_adj and 7 other fields | High correlation |
1st_curr_lda_adj is highly overall correlated with 1st_const_year and 5 other fields | High correlation |
1st_curr_stm_adj is highly overall correlated with 1st_const_year and 5 other fields | High correlation |
1st_curr_tfidf_adj is highly overall correlated with 1st_curr_lda_adj and 3 other fields | High correlation |
1st_curr_use_adj is highly overall correlated with 1st_curr_lda_adj and 3 other fields | High correlation |
1st_current_lda is highly overall correlated with 1st_current_stm and 3 other fields | High correlation |
1st_current_stm is highly overall correlated with 1st_current_lda and 4 other fields | High correlation |
1st_current_tfidf is highly overall correlated with 1st_2nd_tfidf and 7 other fields | High correlation |
1st_current_use is highly overall correlated with 1st_2nd_use and 2 other fields | High correlation |
2nd_const_year is highly overall correlated with 1st_const_year and 6 other fields | High correlation |
constitutional_time is highly overall correlated with 1st_const_year and 7 other fields | High correlation |
first_regime_time is highly overall correlated with 1st_2nd_lda_adj and 3 other fields | High correlation |
lda_distance is highly overall correlated with 1st_const_year and 8 other fields | High correlation |
stm_distance is highly overall correlated with 1st_const_year and 8 other fields | High correlation |
tfidf_distance is highly overall correlated with 1st_2nd_tfidf and 9 other fields | High correlation |
use_distance is highly overall correlated with 1st_2nd_use and 8 other fields | High correlation |
2nd_const_year has 63 (32.6%) missing values | Missing |
1st_2nd_tfidf has 63 (32.6%) missing values | Missing |
1st_current_tfidf has 63 (32.6%) missing values | Missing |
1st_2nd_lda has 63 (32.6%) missing values | Missing |
1st_current_lda has 63 (32.6%) missing values | Missing |
1st_2nd_use has 63 (32.6%) missing values | Missing |
1st_current_use has 63 (32.6%) missing values | Missing |
1st_2nd_stm has 63 (32.6%) missing values | Missing |
1st_current_stm has 63 (32.6%) missing values | Missing |
1st_2nd_tfidf_adj has 63 (32.6%) missing values | Missing |
1st_2nd_lda_adj has 63 (32.6%) missing values | Missing |
1st_2nd_use_adj has 63 (32.6%) missing values | Missing |
1st_2nd_stm_adj has 63 (32.6%) missing values | Missing |
1st_curr_tfidf_adj has 63 (32.6%) missing values | Missing |
1st_curr_lda_adj has 63 (32.6%) missing values | Missing |
1st_curr_use_adj has 63 (32.6%) missing values | Missing |
1st_curr_stm_adj has 63 (32.6%) missing values | Missing |
code has unique values | Unique |
country has unique values | Unique |
tfidf_distance has 63 (32.6%) zeros | Zeros |
lda_distance has 63 (32.6%) zeros | Zeros |
use_distance has 63 (32.6%) zeros | Zeros |
stm_distance has 63 (32.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-01-28 18:34:21.079233 |
|---|---|
| Analysis finished | 2024-01-28 18:36:03.522419 |
| Duration | 1 minute and 42.44 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
code
Text
UNIQUE 
| Distinct | 193 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0051813 |
| Min length | 3 |
Characters and Unicode
| Total characters | 580 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 193 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AFG |
|---|---|
| 2nd row | ALB |
| 3rd row | DZA |
| 4th row | AND |
| 5th row | AGO |
| Value | Count | Frequency (%) |
| afg | 1 | 0.5% |
| alb | 1 | 0.5% |
| bhs | 1 | 0.5% |
| dza | 1 | 0.5% |
| and | 1 | 0.5% |
| ago | 1 | 0.5% |
| atg | 1 | 0.5% |
| arg | 1 | 0.5% |
| arm | 1 | 0.5% |
| aus | 1 | 0.5% |
| Other values (183) | 183 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 580 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 580 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
country
Text
UNIQUE 
| Distinct | 193 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 21 |
| Mean length | 8.4559585 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1632 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 193 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Afghanistan |
|---|---|
| 2nd row | Albania |
| 3rd row | Algeria |
| 4th row | Andorra |
| 5th row | Angola |
| Value | Count | Frequency (%) |
| republic | 5 | 2.1% |
| and | 4 | 1.7% |
| guinea | 4 | 1.7% |
| saint | 3 | 1.2% |
| south | 3 | 1.2% |
| the | 2 | 0.8% |
| korea | 2 | 0.8% |
| united | 2 | 0.8% |
| sudan | 2 | 0.8% |
| arab | 2 | 0.8% |
| Other values (207) | 211 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | 8.8% |
| n | 125 | 7.7% |
| e | 109 | 6.7% |
| o | 90 | 5.5% |
| r | 89 | 5.5% |
| u | 66 | 4.0% |
| t | 62 | 3.8% |
| l | 60 | 3.7% |
| s | 52 | 3.2% |
| Other values (41) | 580 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1352 | |
| Uppercase Letter | 233 | 14.3% |
| Space Separator | 47 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | |
| n | 125 | |
| e | 109 | 8.1% |
| o | 90 | 6.7% |
| r | 89 | 6.6% |
| u | 66 | 4.9% |
| t | 62 | 4.6% |
| l | 60 | 4.4% |
| s | 52 | 3.8% |
| Other values (16) | 300 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 27 | 11.6% |
| M | 19 | 8.2% |
| B | 18 | 7.7% |
| C | 18 | 7.7% |
| A | 17 | 7.3% |
| T | 15 | 6.4% |
| G | 15 | 6.4% |
| N | 12 | 5.2% |
| L | 12 | 5.2% |
| I | 10 | 4.3% |
| Other values (14) | 70 |
Space Separator
| Value | Count | Frequency (%) |
| 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1585 | |
| Common | 47 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | 9.1% |
| n | 125 | 7.9% |
| e | 109 | 6.9% |
| o | 90 | 5.7% |
| r | 89 | 5.6% |
| u | 66 | 4.2% |
| t | 62 | 3.9% |
| l | 60 | 3.8% |
| s | 52 | 3.3% |
| Other values (40) | 533 |
Common
| Value | Count | Frequency (%) |
| 47 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1632 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | 8.8% |
| n | 125 | 7.7% |
| e | 109 | 6.7% |
| o | 90 | 5.5% |
| r | 89 | 5.5% |
| u | 66 | 4.0% |
| t | 62 | 3.8% |
| l | 60 | 3.7% |
| s | 52 | 3.2% |
| Other values (41) | 580 |
1st_const_year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 99 |
|---|---|
| Distinct (%) | 51.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1935.7927 |
| Minimum | 1789 |
|---|---|
| Maximum | 2011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 1789 |
|---|---|
| 5-th percentile | 1816.4 |
| Q1 | 1919 |
| median | 1960 |
| Q3 | 1975 |
| 95-th percentile | 1995 |
| Maximum | 2011 |
| Range | 222 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 57.592123 |
|---|---|
| Coefficient of variation (CV) | 0.029751182 |
| Kurtosis | 0.10938219 |
| Mean | 1935.7927 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | -1.1392082 |
| Sum | 373608 |
| Variance | 3316.8527 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1960 | 9 | 4.7% |
| 1962 | 8 | 4.1% |
| 1978 | 6 | 3.1% |
| 1947 | 5 | 2.6% |
| 1991 | 5 | 2.6% |
| 1979 | 5 | 2.6% |
| 1964 | 5 | 2.6% |
| 1981 | 5 | 2.6% |
| 1975 | 5 | 2.6% |
| 1961 | 5 | 2.6% |
| Other values (89) | 135 |
| Value | Count | Frequency (%) |
| 1789 | 1 | |
| 1791 | 2 | |
| 1795 | 1 | |
| 1801 | 1 | |
| 1808 | 1 | |
| 1809 | 1 | |
| 1811 | 1 | |
| 1813 | 1 | |
| 1814 | 1 | |
| 1818 | 1 |
| Value | Count | Frequency (%) |
| 2011 | 1 | 0.5% |
| 2008 | 1 | 0.5% |
| 2003 | 1 | 0.5% |
| 2002 | 1 | 0.5% |
| 1998 | 1 | 0.5% |
| 1997 | 1 | 0.5% |
| 1996 | 2 | |
| 1995 | 4 | |
| 1994 | 2 | |
| 1993 | 3 |
2nd_const_year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 83 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1950.1308 |
| Minimum | 1793 |
|---|---|
| Maximum | 2011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 1793 |
|---|---|
| 5-th percentile | 1828.9 |
| Q1 | 1931 |
| median | 1969.5 |
| Q3 | 1989.75 |
| 95-th percentile | 2007.55 |
| Maximum | 2011 |
| Range | 218 |
| Interquartile range (IQR) | 58.75 |
Descriptive statistics
| Standard deviation | 53.931106 |
|---|---|
| Coefficient of variation (CV) | 0.027655123 |
| Kurtosis | 0.77941336 |
| Mean | 1950.1308 |
| Median Absolute Deviation (MAD) | 24.5 |
| Skewness | -1.2849272 |
| Sum | 253517 |
| Variance | 2908.5642 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1979 | 4 | 2.1% |
| 1992 | 4 | 2.1% |
| 1962 | 4 | 2.1% |
| 1974 | 3 | 1.6% |
| 1996 | 3 | 1.6% |
| 2005 | 3 | 1.6% |
| 1970 | 3 | 1.6% |
| 2008 | 3 | 1.6% |
| 1978 | 3 | 1.6% |
| 1959 | 2 | 1.0% |
| Other values (73) | 98 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 1793 | 1 | |
| 1805 | 1 | |
| 1812 | 1 | |
| 1815 | 1 | |
| 1823 | 1 | |
| 1826 | 1 | |
| 1828 | 1 | |
| 1830 | 1 | |
| 1831 | 1 | |
| 1836 | 1 |
| Value | Count | Frequency (%) |
| 2011 | 2 | |
| 2010 | 2 | |
| 2008 | 3 | |
| 2007 | 1 | 0.5% |
| 2005 | 3 | |
| 2002 | 2 | |
| 2001 | 2 | |
| 1999 | 1 | 0.5% |
| 1998 | 1 | 0.5% |
| 1996 | 3 |
1st_2nd_tfidf
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55464505 |
| Minimum | 0.0201733 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.0201733 |
|---|---|
| 5-th percentile | 0.067218377 |
| Q1 | 0.35113069 |
| median | 0.57103248 |
| Q3 | 0.8262868 |
| 95-th percentile | 0.96160684 |
| Maximum | 1 |
| Range | 0.9798267 |
| Interquartile range (IQR) | 0.47515612 |
Descriptive statistics
| Standard deviation | 0.28979249 |
|---|---|
| Coefficient of variation (CV) | 0.52248279 |
| Kurtosis | -1.0399434 |
| Mean | 0.55464505 |
| Median Absolute Deviation (MAD) | 0.24142873 |
| Skewness | -0.22781695 |
| Sum | 72.103856 |
| Variance | 0.083979689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3536304 | 1 | 0.5% |
| 0.39374864 | 1 | 0.5% |
| 0.98284872 | 1 | 0.5% |
| 0.57017142 | 1 | 0.5% |
| 0.89508546 | 1 | 0.5% |
| 0.0300009 | 1 | 0.5% |
| 0.77503286 | 1 | 0.5% |
| 0.14577967 | 1 | 0.5% |
| 0.80957755 | 1 | 0.5% |
| 0.6414186 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0201733 | 1 | |
| 0.0300009 | 1 | |
| 0.03110075 | 1 | |
| 0.04367113 | 1 | |
| 0.0565919 | 1 | |
| 0.0569241 | 1 | |
| 0.06451124 | 1 | |
| 0.0705271 | 1 | |
| 0.0705558 | 1 | |
| 0.08149785 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.9976256217 | 1 | |
| 0.98284872 | 1 | |
| 0.976940108 | 1 | |
| 0.974659728 | 1 | |
| 0.968144733 | 1 | |
| 0.962688424 | 1 | |
| 0.960284915 | 1 | |
| 0.95103436 | 1 | |
| 0.94884168 | 1 |
1st_current_tfidf
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6039566 |
| Minimum | 0.0201733 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.0201733 |
|---|---|
| 5-th percentile | 0.076121546 |
| Q1 | 0.37213125 |
| median | 0.64427824 |
| Q3 | 0.87531829 |
| 95-th percentile | 0.98309939 |
| Maximum | 1 |
| Range | 0.9798267 |
| Interquartile range (IQR) | 0.50318705 |
Descriptive statistics
| Standard deviation | 0.29539479 |
|---|---|
| Coefficient of variation (CV) | 0.48909937 |
| Kurtosis | -1.0258001 |
| Mean | 0.6039566 |
| Median Absolute Deviation (MAD) | 0.24769431 |
| Skewness | -0.47133038 |
| Sum | 78.514357 |
| Variance | 0.087258082 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3035168 | 1 | 0.5% |
| 0.4996174 | 1 | 0.5% |
| 0.98284872 | 1 | 0.5% |
| 0.74367234 | 1 | 0.5% |
| 0.73904103 | 1 | 0.5% |
| 0.2395039 | 1 | 0.5% |
| 0.4761298 | 1 | 0.5% |
| 0.1445826 | 1 | 0.5% |
| 0.921665385 | 1 | 0.5% |
| 0.983304482 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0201733 | 1 | |
| 0.03110075 | 1 | |
| 0.0565919 | 1 | |
| 0.0569241 | 1 | |
| 0.0605006 | 1 | |
| 0.06490505 | 1 | |
| 0.07049835 | 1 | |
| 0.08299434 | 1 | |
| 0.08384097 | 1 | |
| 0.08689183 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.9976256217 | 1 | |
| 0.9936541156 | 1 | |
| 0.990213237 | 1 | |
| 0.98426032 | 1 | |
| 0.98410011 | 1 | |
| 0.983304482 | 1 | |
| 0.98284872 | 1 | |
| 0.976940108 | 1 | |
| 0.96477796 | 1 |
1st_2nd_lda
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.44316561 |
| Minimum | 0.00092487985 |
|---|---|
| Maximum | 0.832416 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.00092487985 |
|---|---|
| 5-th percentile | 0.1120277 |
| Q1 | 0.29440284 |
| median | 0.43651447 |
| Q3 | 0.57779393 |
| 95-th percentile | 0.80670039 |
| Maximum | 0.832416 |
| Range | 0.83149112 |
| Interquartile range (IQR) | 0.28339109 |
Descriptive statistics
| Standard deviation | 0.20311936 |
|---|---|
| Coefficient of variation (CV) | 0.45833738 |
| Kurtosis | -0.64232809 |
| Mean | 0.44316561 |
| Median Absolute Deviation (MAD) | 0.1429888 |
| Skewness | 0.047521627 |
| Sum | 57.611529 |
| Variance | 0.041257476 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.4902039477 | 1 | 0.5% |
| 0.4247056517 | 1 | 0.5% |
| 0.7338111011 | 1 | 0.5% |
| 0.1832512237 | 1 | 0.5% |
| 0.5699860148 | 1 | 0.5% |
| 0.05153921136 | 1 | 0.5% |
| 0.1837521032 | 1 | 0.5% |
| 0.1090214883 | 1 | 0.5% |
| 0.6453213329 | 1 | 0.5% |
| 0.6659218507 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0009248798472 | 1 | |
| 0.009009424209 | 1 | |
| 0.02491735492 | 1 | |
| 0.05153921136 | 1 | |
| 0.106568484 | 1 | |
| 0.1090214883 | 1 | |
| 0.1107000823 | 1 | |
| 0.1136503332 | 1 | |
| 0.1224426352 | 1 | |
| 0.1570453903 | 1 |
| Value | Count | Frequency (%) |
| 0.8324159977 | 1 | |
| 0.8322413363 | 1 | |
| 0.831322489 | 1 | |
| 0.8247713524 | 1 | |
| 0.8150825904 | 1 | |
| 0.8112244864 | 1 | |
| 0.810847399 | 1 | |
| 0.8016318293 | 1 | |
| 0.777378254 | 1 | |
| 0.7398105908 | 1 |
1st_current_lda
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.58351659 |
| Minimum | 0.0090094242 |
|---|---|
| Maximum | 0.83242574 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.0090094242 |
|---|---|
| 5-th percentile | 0.22517292 |
| Q1 | 0.48043962 |
| median | 0.61706904 |
| Q3 | 0.73142041 |
| 95-th percentile | 0.81790543 |
| Maximum | 0.83242574 |
| Range | 0.82341631 |
| Interquartile range (IQR) | 0.25098079 |
Descriptive statistics
| Standard deviation | 0.19093433 |
|---|---|
| Coefficient of variation (CV) | 0.3272132 |
| Kurtosis | 0.2771821 |
| Mean | 0.58351659 |
| Median Absolute Deviation (MAD) | 0.11933046 |
| Skewness | -0.87972268 |
| Sum | 75.857157 |
| Variance | 0.03645592 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6341485178 | 1 | 0.5% |
| 0.5404719296 | 1 | 0.5% |
| 0.7338111011 | 1 | 0.5% |
| 0.6720473046 | 1 | 0.5% |
| 0.7431631803 | 1 | 0.5% |
| 0.4167833364 | 1 | 0.5% |
| 0.2256352177 | 1 | 0.5% |
| 0.04773825738 | 1 | 0.5% |
| 0.8312122451 | 1 | 0.5% |
| 0.8022149626 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.009009424209 | 1 | |
| 0.04773825738 | 1 | |
| 0.106568484 | 1 | |
| 0.1136503332 | 1 | |
| 0.1423339059 | 1 | |
| 0.1482686852 | 1 | |
| 0.2247946763 | 1 | |
| 0.2256352177 | 1 | |
| 0.2632287717 | 1 | |
| 0.2703033666 | 1 |
| Value | Count | Frequency (%) |
| 0.8324257359 | 1 | |
| 0.8324159977 | 1 | |
| 0.8321897687 | 1 | |
| 0.8312122451 | 1 | |
| 0.8246794945 | 1 | |
| 0.8213321706 | 1 | |
| 0.8202150196 | 1 | |
| 0.8150825904 | 1 | |
| 0.8131771201 | 1 | |
| 0.8112244864 | 1 |
1st_2nd_use
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10565743 |
| Minimum | 0.0053433587 |
|---|---|
| Maximum | 0.42081453 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.0053433587 |
|---|---|
| 5-th percentile | 0.010897003 |
| Q1 | 0.039871121 |
| median | 0.067716425 |
| Q3 | 0.15746744 |
| 95-th percentile | 0.30632343 |
| Maximum | 0.42081453 |
| Range | 0.41547117 |
| Interquartile range (IQR) | 0.11759631 |
Descriptive statistics
| Standard deviation | 0.093243389 |
|---|---|
| Coefficient of variation (CV) | 0.88250666 |
| Kurtosis | 1.0813675 |
| Mean | 0.10565743 |
| Median Absolute Deviation (MAD) | 0.038852176 |
| Skewness | 1.3224261 |
| Sum | 13.735466 |
| Variance | 0.0086943297 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.05512329385 | 1 | 0.5% |
| 0.05099676958 | 1 | 0.5% |
| 0.3285916033 | 1 | 0.5% |
| 0.01088671013 | 1 | 0.5% |
| 0.04217074802 | 1 | 0.5% |
| 0.00715785513 | 1 | 0.5% |
| 0.05273526816 | 1 | 0.5% |
| 0.01315690476 | 1 | 0.5% |
| 0.2756207437 | 1 | 0.5% |
| 0.3081018993 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.005343358713 | 1 | |
| 0.006509150755 | 1 | |
| 0.00693760752 | 1 | |
| 0.007083448061 | 1 | |
| 0.00715785513 | 1 | |
| 0.01004561837 | 1 | |
| 0.01088671013 | 1 | |
| 0.010909584 | 1 | |
| 0.01315690476 | 1 | |
| 0.01458493451 | 1 |
| Value | Count | Frequency (%) |
| 0.4208145276 | 1 | |
| 0.3867030993 | 1 | |
| 0.3642154469 | 1 | |
| 0.3285916033 | 1 | |
| 0.3113730362 | 1 | |
| 0.3081018993 | 1 | |
| 0.3070696313 | 1 | |
| 0.3054114134 | 1 | |
| 0.3009266105 | 1 | |
| 0.3005765842 | 1 |
1st_current_use
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.14357442 |
| Minimum | 0.0047233982 |
|---|---|
| Maximum | 0.52326219 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.0047233982 |
|---|---|
| 5-th percentile | 0.026550543 |
| Q1 | 0.070851784 |
| median | 0.12055374 |
| Q3 | 0.19087579 |
| 95-th percentile | 0.31322431 |
| Maximum | 0.52326219 |
| Range | 0.51853879 |
| Interquartile range (IQR) | 0.12002401 |
Descriptive statistics
| Standard deviation | 0.099832416 |
|---|---|
| Coefficient of variation (CV) | 0.69533566 |
| Kurtosis | 1.864929 |
| Mean | 0.14357442 |
| Median Absolute Deviation (MAD) | 0.05930442 |
| Skewness | 1.2405614 |
| Sum | 18.664675 |
| Variance | 0.0099665112 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1669640441 | 1 | 0.5% |
| 0.08687377779 | 1 | 0.5% |
| 0.3285916033 | 1 | 0.5% |
| 0.08684611184 | 1 | 0.5% |
| 0.1512387381 | 1 | 0.5% |
| 0.07166519638 | 1 | 0.5% |
| 0.09127517087 | 1 | 0.5% |
| 0.00896903061 | 1 | 0.5% |
| 0.5029965022 | 1 | 0.5% |
| 0.1078774117 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.004723398218 | 1 | |
| 0.005343358713 | 1 | |
| 0.00896903061 | 1 | |
| 0.009546910405 | 1 | |
| 0.010909584 | 1 | |
| 0.01349929613 | 1 | |
| 0.02200860971 | 1 | |
| 0.03210179473 | 1 | |
| 0.03330747769 | 1 | |
| 0.03364362066 | 1 |
| Value | Count | Frequency (%) |
| 0.5232621873 | 1 | |
| 0.5029965022 | 1 | |
| 0.4208145276 | 1 | |
| 0.3642154469 | 1 | |
| 0.3380186079 | 1 | |
| 0.3285916033 | 1 | |
| 0.3143392196 | 1 | |
| 0.3118616537 | 1 | |
| 0.3070696313 | 1 | |
| 0.3057088118 | 1 |
1st_2nd_stm
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.42555344 |
| Minimum | 0.053467835 |
|---|---|
| Maximum | 0.83107211 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.053467835 |
|---|---|
| 5-th percentile | 0.13454242 |
| Q1 | 0.26757654 |
| median | 0.39443323 |
| Q3 | 0.57040201 |
| 95-th percentile | 0.77478921 |
| Maximum | 0.83107211 |
| Range | 0.77760428 |
| Interquartile range (IQR) | 0.30282547 |
Descriptive statistics
| Standard deviation | 0.1973448 |
|---|---|
| Coefficient of variation (CV) | 0.46373683 |
| Kurtosis | -0.71125218 |
| Mean | 0.42555344 |
| Median Absolute Deviation (MAD) | 0.13632913 |
| Skewness | 0.34025448 |
| Sum | 55.321947 |
| Variance | 0.038944971 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1808094633 | 1 | 0.5% |
| 0.3401846372 | 1 | 0.5% |
| 0.7927069066 | 1 | 0.5% |
| 0.2597674839 | 1 | 0.5% |
| 0.1086374728 | 1 | 0.5% |
| 0.2751306334 | 1 | 0.5% |
| 0.1426708258 | 1 | 0.5% |
| 0.2371090592 | 1 | 0.5% |
| 0.496686597 | 1 | 0.5% |
| 0.7635874396 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.05346783526 | 1 | |
| 0.06301731907 | 1 | |
| 0.06664675472 | 1 | |
| 0.09193017521 | 1 | |
| 0.103935994 | 1 | |
| 0.1086374728 | 1 | |
| 0.1278919017 | 1 | |
| 0.1426708258 | 1 | |
| 0.1674803379 | 1 | |
| 0.1795426932 | 1 |
| Value | Count | Frequency (%) |
| 0.8310721109 | 1 | |
| 0.8302623488 | 1 | |
| 0.8220312078 | 1 | |
| 0.8139319874 | 1 | |
| 0.8129855665 | 1 | |
| 0.7927069066 | 1 | |
| 0.7750710625 | 1 | |
| 0.7744447283 | 1 | |
| 0.7646634819 | 1 | |
| 0.7636754462 | 1 |
1st_current_stm
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55280506 |
| Minimum | 0.066646755 |
|---|---|
| Maximum | 0.83126568 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.066646755 |
|---|---|
| 5-th percentile | 0.20777049 |
| Q1 | 0.3816559 |
| median | 0.58016529 |
| Q3 | 0.74755949 |
| 95-th percentile | 0.82573001 |
| Maximum | 0.83126568 |
| Range | 0.76461893 |
| Interquartile range (IQR) | 0.36590359 |
Descriptive statistics
| Standard deviation | 0.20597414 |
|---|---|
| Coefficient of variation (CV) | 0.37259814 |
| Kurtosis | -1.0161123 |
| Mean | 0.55280506 |
| Median Absolute Deviation (MAD) | 0.1884196 |
| Skewness | -0.31253957 |
| Sum | 71.864658 |
| Variance | 0.042425345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3514579609 | 1 | 0.5% |
| 0.6599127299 | 1 | 0.5% |
| 0.7927069066 | 1 | 0.5% |
| 0.7922989702 | 1 | 0.5% |
| 0.8312656847 | 1 | 0.5% |
| 0.5429023615 | 1 | 0.5% |
| 0.2147874609 | 1 | 0.5% |
| 0.1978936538 | 1 | 0.5% |
| 0.7989632322 | 1 | 0.5% |
| 0.8304725389 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.06664675472 | 1 | |
| 0.08709304174 | 1 | |
| 0.103935994 | 1 | |
| 0.1905146018 | 1 | |
| 0.1978936538 | 1 | |
| 0.2014834105 | 1 | |
| 0.2020293374 | 1 | |
| 0.2147874609 | 1 | |
| 0.2274341764 | 1 | |
| 0.2287969862 | 1 |
| Value | Count | Frequency (%) |
| 0.8312656847 | 1 | |
| 0.8310721109 | 1 | |
| 0.8304725389 | 1 | |
| 0.8302623488 | 1 | |
| 0.8294142647 | 1 | |
| 0.8263893582 | 1 | |
| 0.8261795564 | 1 | |
| 0.8251805678 | 1 | |
| 0.8231224917 | 1 | |
| 0.8229771521 | 1 |
constitutional_time
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 99 |
|---|---|
| Distinct (%) | 51.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.207254 |
| Minimum | 11 |
|---|---|
| Maximum | 233 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 47 |
| median | 62 |
| Q3 | 103 |
| 95-th percentile | 205.6 |
| Maximum | 233 |
| Range | 222 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 57.592123 |
|---|---|
| Coefficient of variation (CV) | 0.66806586 |
| Kurtosis | 0.10938219 |
| Mean | 86.207254 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 1.1392082 |
| Sum | 16638 |
| Variance | 3316.8527 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 9 | 4.7% |
| 60 | 8 | 4.1% |
| 44 | 6 | 3.1% |
| 75 | 5 | 2.6% |
| 31 | 5 | 2.6% |
| 43 | 5 | 2.6% |
| 58 | 5 | 2.6% |
| 41 | 5 | 2.6% |
| 47 | 5 | 2.6% |
| 61 | 5 | 2.6% |
| Other values (89) | 135 |
| Value | Count | Frequency (%) |
| 11 | 1 | 0.5% |
| 14 | 1 | 0.5% |
| 19 | 1 | 0.5% |
| 20 | 1 | 0.5% |
| 24 | 1 | 0.5% |
| 25 | 1 | 0.5% |
| 26 | 2 | |
| 27 | 4 | |
| 28 | 2 | |
| 29 | 3 |
| Value | Count | Frequency (%) |
| 233 | 1 | |
| 231 | 2 | |
| 227 | 1 | |
| 221 | 1 | |
| 214 | 1 | |
| 213 | 1 | |
| 211 | 1 | |
| 209 | 1 | |
| 208 | 1 | |
| 204 | 1 |
first_regime_time
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 80 |
|---|---|
| Distinct (%) | 41.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.797927 |
| Minimum | 1 |
|---|---|
| Maximum | 233 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 14 |
| median | 27 |
| Q3 | 48 |
| 95-th percentile | 110.2 |
| Maximum | 233 |
| Range | 232 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 39.518793 |
|---|---|
| Coefficient of variation (CV) | 1.045528 |
| Kurtosis | 7.3855154 |
| Mean | 37.797927 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 2.4693408 |
| Sum | 7295 |
| Variance | 1561.735 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 7 | 3.6% |
| 14 | 7 | 3.6% |
| 16 | 6 | 3.1% |
| 4 | 6 | 3.1% |
| 3 | 5 | 2.6% |
| 19 | 5 | 2.6% |
| 31 | 5 | 2.6% |
| 5 | 5 | 2.6% |
| 15 | 5 | 2.6% |
| 28 | 5 | 2.6% |
| Other values (70) | 137 |
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 3 | |
| 3 | 5 | |
| 4 | 6 | |
| 5 | 5 | |
| 6 | 4 | |
| 7 | 1 | 0.5% |
| 8 | 3 | |
| 9 | 3 | |
| 10 | 3 |
| Value | Count | Frequency (%) |
| 233 | 1 | |
| 208 | 1 | |
| 201 | 1 | |
| 191 | 1 | |
| 170 | 1 | |
| 165 | 1 | |
| 154 | 1 | |
| 147 | 1 | |
| 139 | 1 | |
| 121 | 1 |
1st_2nd_tfidf_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.064686728 |
| Minimum | 0.000502996 |
|---|---|
| Maximum | 0.90709697 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.000502996 |
|---|---|
| 5-th percentile | 0.0043163806 |
| Q1 | 0.010681518 |
| median | 0.02548487 |
| Q3 | 0.055534077 |
| 95-th percentile | 0.237889 |
| Maximum | 0.90709697 |
| Range | 0.90659398 |
| Interquartile range (IQR) | 0.044852559 |
Descriptive statistics
| Standard deviation | 0.13129391 |
|---|---|
| Coefficient of variation (CV) | 2.0296885 |
| Kurtosis | 26.298727 |
| Mean | 0.064686728 |
| Median Absolute Deviation (MAD) | 0.016638227 |
| Skewness | 4.7766238 |
| Sum | 8.4092747 |
| Variance | 0.01723809 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.0442038 | 1 | 0.5% |
| 0.01093746222 | 1 | 0.5% |
| 0.049142436 | 1 | 0.5% |
| 0.016290612 | 1 | 0.5% |
| 0.03086501586 | 1 | 0.5% |
| 0.0100003 | 1 | 0.5% |
| 0.03229303583 | 1 | 0.5% |
| 0.02429661167 | 1 | 0.5% |
| 0.02611540484 | 1 | 0.5% |
| 0.04276124 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.000502996 | 1 | |
| 0.0008017704545 | 1 | |
| 0.00201733 | 1 | |
| 0.002350504317 | 1 | |
| 0.003794420833 | 1 | |
| 0.004042278571 | 1 | |
| 0.00406967027 | 1 | |
| 0.004617915385 | 1 | |
| 0.00465801125 | 1 | |
| 0.0050721 | 1 |
| Value | Count | Frequency (%) |
| 0.907096975 | 1 | |
| 0.90364835 | 1 | |
| 0.55195415 | 1 | |
| 0.35231498 | 1 | |
| 0.322714911 | 1 | |
| 0.31628056 | 1 | |
| 0.2379957 | 1 | |
| 0.23775859 | 1 | |
| 0.227624476 | 1 | |
| 0.20651088 | 1 |
1st_2nd_lda_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.049513003 |
| Minimum | 3.5572302 × 10-5 |
|---|---|
| Maximum | 0.80163183 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 3.5572302 × 10-5 |
|---|---|
| 5-th percentile | 0.0064854003 |
| Q1 | 0.011824608 |
| median | 0.020550206 |
| Q3 | 0.041485869 |
| 95-th percentile | 0.17573289 |
| Maximum | 0.80163183 |
| Range | 0.80159626 |
| Interquartile range (IQR) | 0.029661262 |
Descriptive statistics
| Standard deviation | 0.10713851 |
|---|---|
| Coefficient of variation (CV) | 2.1638459 |
| Kurtosis | 35.146391 |
| Mean | 0.049513003 |
| Median Absolute Deviation (MAD) | 0.010883239 |
| Skewness | 5.5850728 |
| Sum | 6.4366903 |
| Variance | 0.01147866 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.06127549346 | 1 | 0.5% |
| 0.01179737921 | 1 | 0.5% |
| 0.03669055506 | 1 | 0.5% |
| 0.005235749249 | 1 | 0.5% |
| 0.01965469016 | 1 | 0.5% |
| 0.01717973712 | 1 | 0.5% |
| 0.007656337635 | 1 | 0.5% |
| 0.01817024805 | 1 | 0.5% |
| 0.02081681719 | 1 | 0.5% |
| 0.04439479005 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 3.557230182 × 10-5 | 1 | |
| 0.002620096689 | 1 | |
| 0.003180420336 | 1 | |
| 0.004033466684 | 1 | |
| 0.005235749249 | 1 | |
| 0.006174361731 | 1 | |
| 0.006477954831 | 1 | |
| 0.006494500368 | 1 | |
| 0.00649978057 | 1 | |
| 0.006762087726 | 1 |
| Value | Count | Frequency (%) |
| 0.8016318293 | 1 | |
| 0.777378254 | 1 | |
| 0.4077143686 | 1 | |
| 0.2328861703 | 1 | |
| 0.1969738114 | 1 | |
| 0.187437504 | 1 | |
| 0.179047086 | 1 | |
| 0.1716821955 | 1 | |
| 0.1490337043 | 1 | |
| 0.1458244548 | 1 |
1st_2nd_use_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.015300818 |
| Minimum | 0.00027584786 |
|---|---|
| Maximum | 0.3867031 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.00027584786 |
|---|---|
| 5-th percentile | 0.0007827632 |
| Q1 | 0.001691636 |
| median | 0.0035197699 |
| Q3 | 0.0093014493 |
| 95-th percentile | 0.048383308 |
| Maximum | 0.3867031 |
| Range | 0.38642725 |
| Interquartile range (IQR) | 0.0076098133 |
Descriptive statistics
| Standard deviation | 0.047631354 |
|---|---|
| Coefficient of variation (CV) | 3.1129939 |
| Kurtosis | 40.109223 |
| Mean | 0.015300818 |
| Median Absolute Deviation (MAD) | 0.0024572005 |
| Skewness | 6.0891318 |
| Sum | 1.9891064 |
| Variance | 0.0022687459 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.006890411732 | 1 | 0.5% |
| 0.001416576933 | 1 | 0.5% |
| 0.01642958016 | 1 | 0.5% |
| 0.0003110488609 | 1 | 0.5% |
| 0.001454163725 | 1 | 0.5% |
| 0.00238595171 | 1 | 0.5% |
| 0.00219730284 | 1 | 0.5% |
| 0.00219281746 | 1 | 0.5% |
| 0.008890991733 | 1 | 0.5% |
| 0.02054012662 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0002758478593 | 1 | |
| 0.0003110488609 | 1 | |
| 0.000346880376 | 1 | |
| 0.0007009087638 | 1 | |
| 0.0007175441693 | 1 | |
| 0.0007365845831 | 1 | |
| 0.000779256 | 1 | |
| 0.0007870497845 | 1 | |
| 0.0007882253443 | 1 | |
| 0.00086396236 | 1 |
| Value | Count | Frequency (%) |
| 0.3867030993 | 1 | |
| 0.3005765842 | 1 | |
| 0.2158303006 | 1 | |
| 0.1053613701 | 1 | |
| 0.08750394489 | 1 | |
| 0.0553042328 | 1 | |
| 0.05238983032 | 1 | |
| 0.04348644751 | 1 | |
| 0.03195685445 | 1 | |
| 0.02834948051 | 1 |
1st_2nd_stm_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.053535561 |
| Minimum | 0.0016827882 |
|---|---|
| Maximum | 0.77444473 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.0016827882 |
|---|---|
| 5-th percentile | 0.0051174882 |
| Q1 | 0.010717273 |
| median | 0.020480126 |
| Q3 | 0.044950708 |
| 95-th percentile | 0.19435204 |
| Maximum | 0.77444473 |
| Range | 0.77276194 |
| Interquartile range (IQR) | 0.034233435 |
Descriptive statistics
| Standard deviation | 0.11514934 |
|---|---|
| Coefficient of variation (CV) | 2.1508945 |
| Kurtosis | 26.221818 |
| Mean | 0.053535561 |
| Median Absolute Deviation (MAD) | 0.011627264 |
| Skewness | 4.8958874 |
| Sum | 6.9596229 |
| Variance | 0.013259371 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.02260118292 | 1 | 0.5% |
| 0.009449573254 | 1 | 0.5% |
| 0.03963534533 | 1 | 0.5% |
| 0.007421928113 | 1 | 0.5% |
| 0.003746119751 | 1 | 0.5% |
| 0.09171021114 | 1 | 0.5% |
| 0.005944617741 | 1 | 0.5% |
| 0.03951817653 | 1 | 0.5% |
| 0.01602214829 | 1 | 0.5% |
| 0.05090582931 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.001682788181 | 1 | |
| 0.002165333208 | 1 | |
| 0.002688331847 | 1 | |
| 0.00276490608 | 1 | |
| 0.003657165131 | 1 | |
| 0.003746119751 | 1 | |
| 0.004710118217 | 1 | |
| 0.005615384814 | 1 | |
| 0.005940870585 | 1 | |
| 0.005944617741 | 1 |
| Value | Count | Frequency (%) |
| 0.7744447283 | 1 | |
| 0.7636754462 | 1 | |
| 0.6141453375 | 1 | |
| 0.3477130875 | 1 | |
| 0.2435585183 | 1 | |
| 0.2179150846 | 1 | |
| 0.2083755094 | 1 | |
| 0.1772122481 | 1 | |
| 0.1755816702 | 1 | |
| 0.1558968377 | 1 |
1st_curr_tfidf_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0075894263 |
| Minimum | 0.00038964479 |
|---|---|
| Maximum | 0.023860502 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.00038964479 |
|---|---|
| 5-th percentile | 0.0010232233 |
| Q1 | 0.0038469856 |
| median | 0.0064658677 |
| Q3 | 0.0098641141 |
| 95-th percentile | 0.017966702 |
| Maximum | 0.023860502 |
| Range | 0.023470857 |
| Interquartile range (IQR) | 0.0060171285 |
Descriptive statistics
| Standard deviation | 0.0053160365 |
|---|---|
| Coefficient of variation (CV) | 0.70045302 |
| Kurtosis | 0.64041234 |
| Mean | 0.0075894263 |
| Median Absolute Deviation (MAD) | 0.0030692528 |
| Skewness | 1.019514 |
| Sum | 0.98662542 |
| Variance | 2.8260245 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.003065826263 | 1 | 0.5% |
| 0.004234045763 | 1 | 0.5% |
| 0.004329730044 | 1 | 0.5% |
| 0.004534587439 | 1 | 0.5% |
| 0.01192001661 | 1 | 0.5% |
| 0.003862966129 | 1 | 0.5% |
| 0.006434186486 | 1 | 0.5% |
| 0.002190645455 | 1 | 0.5% |
| 0.004409882225 | 1 | 0.5% |
| 0.01311072643 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0003896447887 | 1 | |
| 0.0007049835 | 1 | |
| 0.0007232732558 | 1 | |
| 0.0007530491216 | 1 | |
| 0.0008405541667 | 1 | |
| 0.0009431983333 | 1 | |
| 0.0009764686207 | 1 | |
| 0.001080367857 | 1 | |
| 0.001293729545 | 1 | |
| 0.001497160179 | 1 |
| Value | Count | Frequency (%) |
| 0.02386050154 | 1 | |
| 0.02272727273 | 1 | |
| 0.02267330958 | 1 | |
| 0.02196901183 | 1 | |
| 0.02001269931 | 1 | |
| 0.01912179 | 1 | |
| 0.01804372918 | 1 | |
| 0.01787255681 | 1 | |
| 0.01756787192 | 1 | |
| 0.01600473256 | 1 |
1st_curr_lda_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0074436575 |
| Minimum | 0.00020952149 |
|---|---|
| Maximum | 0.024573635 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.00020952149 |
|---|---|
| 5-th percentile | 0.0025482828 |
| Q1 | 0.0039208766 |
| median | 0.0068954084 |
| Q3 | 0.010364076 |
| 95-th percentile | 0.013662685 |
| Maximum | 0.024573635 |
| Range | 0.024364113 |
| Interquartile range (IQR) | 0.0064431993 |
Descriptive statistics
| Standard deviation | 0.0041019489 |
|---|---|
| Coefficient of variation (CV) | 0.55106631 |
| Kurtosis | 1.5026412 |
| Mean | 0.0074436575 |
| Median Absolute Deviation (MAD) | 0.0031089973 |
| Skewness | 0.91986874 |
| Sum | 0.96767547 |
| Variance | 1.6825985 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.006405540584 | 1 | 0.5% |
| 0.00458027059 | 1 | 0.5% |
| 0.003232648023 | 1 | 0.5% |
| 0.004097849419 | 1 | 0.5% |
| 0.01198650291 | 1 | 0.5% |
| 0.006722311877 | 1 | 0.5% |
| 0.003049124563 | 1 | 0.5% |
| 0.00072330693 | 1 | 0.5% |
| 0.003977092082 | 1 | 0.5% |
| 0.0106961995 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0002095214932 | 1 | |
| 0.00072330693 | 1 | |
| 0.001114218953 | 1 | |
| 0.001776141399 | 1 | |
| 0.002319989807 | 1 | |
| 0.002463705894 | 1 | |
| 0.00254167689 | 1 | |
| 0.002556356642 | 1 | |
| 0.002581508016 | 1 | |
| 0.002581733867 | 1 |
| Value | Count | Frequency (%) |
| 0.02457363487 | 1 | |
| 0.0189185454 | 1 | |
| 0.01842834998 | 1 | |
| 0.01616458641 | 1 | |
| 0.01446806131 | 1 | |
| 0.01399008142 | 1 | |
| 0.01374956757 | 1 | |
| 0.01355649497 | 1 | |
| 0.01326802972 | 1 | |
| 0.01287594589 | 1 |
1st_curr_use_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0017634251 |
| Minimum | 8.4346397 × 10-5 |
|---|---|
| Maximum | 0.0082776238 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 8.4346397 × 10-5 |
|---|---|
| 5-th percentile | 0.00022727891 |
| Q1 | 0.00086846631 |
| median | 0.0013117156 |
| Q3 | 0.0021358906 |
| 95-th percentile | 0.0049047103 |
| Maximum | 0.0082776238 |
| Range | 0.0081932774 |
| Interquartile range (IQR) | 0.0012674243 |
Descriptive statistics
| Standard deviation | 0.0015000575 |
|---|---|
| Coefficient of variation (CV) | 0.85064993 |
| Kurtosis | 4.0660565 |
| Mean | 0.0017634251 |
| Median Absolute Deviation (MAD) | 0.00061933365 |
| Skewness | 1.9278975 |
| Sum | 0.22924527 |
| Variance | 2.2501724 × 10-6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.001686505496 | 1 | 0.5% |
| 0.0007362184559 | 1 | 0.5% |
| 0.001447540102 | 1 | 0.5% |
| 0.0005295494624 | 1 | 0.5% |
| 0.002439334485 | 1 | 0.5% |
| 0.001155890264 | 1 | 0.5% |
| 0.001233448255 | 1 | 0.5% |
| 0.0001358944032 | 1 | 0.5% |
| 0.002406681829 | 1 | 0.5% |
| 0.00143836549 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 8.434639675 × 10-5 | 1 | |
| 0.0001242641561 | 1 | |
| 0.0001358944032 | 1 | |
| 0.0001646019035 | 1 | |
| 0.0001818264 | 1 | |
| 0.0002164432958 | 1 | |
| 0.0002228054348 | 1 | |
| 0.000232746485 | 1 | |
| 0.0004024691666 | 1 | |
| 0.0004232424944 | 1 |
| Value | Count | Frequency (%) |
| 0.008277623793 | 1 | |
| 0.007046005358 | 1 | |
| 0.00649811518 | 1 | |
| 0.006098761269 | 1 | |
| 0.005524154085 | 1 | |
| 0.005285793785 | 1 | |
| 0.004930787287 | 1 | |
| 0.004872838339 | 1 | |
| 0.004870150068 | 1 | |
| 0.004724827562 | 1 |
1st_curr_stm_adj
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0069859241 |
| Minimum | 0.00068831784 |
|---|---|
| Maximum | 0.018888003 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0.00068831784 |
|---|---|
| 5-th percentile | 0.0023054443 |
| Q1 | 0.0039356516 |
| median | 0.0066953881 |
| Q3 | 0.0089771201 |
| 95-th percentile | 0.013459805 |
| Maximum | 0.018888003 |
| Range | 0.018199685 |
| Interquartile range (IQR) | 0.0050414685 |
Descriptive statistics
| Standard deviation | 0.0038296462 |
|---|---|
| Coefficient of variation (CV) | 0.54819465 |
| Kurtosis | 0.58270177 |
| Mean | 0.0069859241 |
| Median Absolute Deviation (MAD) | 0.0027434808 |
| Skewness | 0.90019675 |
| Sum | 0.90817013 |
| Variance | 1.466619 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.003550080413 | 1 | 0.5% |
| 0.005592480762 | 1 | 0.5% |
| 0.00349210091 | 1 | 0.5% |
| 0.004831091282 | 1 | 0.5% |
| 0.01340751104 | 1 | 0.5% |
| 0.008756489701 | 1 | 0.5% |
| 0.002902533256 | 1 | 0.5% |
| 0.002998388693 | 1 | 0.5% |
| 0.003822790585 | 1 | 0.5% |
| 0.01107296719 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0006883178409 | 1 | |
| 0.0009845635344 | 1 | |
| 0.001549924528 | 1 | |
| 0.001555232888 | 1 | |
| 0.001905146018 | 1 | |
| 0.002196125401 | 1 | |
| 0.002273599935 | 1 | |
| 0.002344365109 | 1 | |
| 0.002626796698 | 1 | |
| 0.002833015242 | 1 |
| Value | Count | Frequency (%) |
| 0.01888800252 | 1 | |
| 0.01886959884 | 1 | |
| 0.01811448267 | 1 | |
| 0.01590918734 | 1 | |
| 0.01572122068 | 1 | |
| 0.01352755029 | 1 | |
| 0.0135025905 | 1 | |
| 0.01340751104 | 1 | |
| 0.01307142027 | 1 | |
| 0.01291238536 | 1 |
tfidf_distance
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0647123 |
| Minimum | 0 |
|---|---|
| Maximum | 8.8055288 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.61926216 |
| Q3 | 1.4574026 |
| 95-th percentile | 4.3369979 |
| Maximum | 8.8055288 |
| Range | 8.8055288 |
| Interquartile range (IQR) | 1.4574026 |
Descriptive statistics
| Standard deviation | 1.4326588 |
|---|---|
| Coefficient of variation (CV) | 1.345583 |
| Kurtosis | 5.2709358 |
| Mean | 1.0647123 |
| Median Absolute Deviation (MAD) | 0.61926216 |
| Skewness | 2.0547368 |
| Sum | 205.48947 |
| Variance | 2.0525111 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 2.55466716 | 1 | 0.5% |
| 1.346237004 | 1 | 0.5% |
| 0.9828487206 | 1 | 0.5% |
| 3.680461466 | 1 | 0.5% |
| 2.866323628 | 1 | 0.5% |
| 0.3675777912 | 1 | 0.5% |
| 1.50062342 | 1 | 0.5% |
| 0.3533120155 | 1 | 0.5% |
| 2.627345592 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.02017331123 | 1 | 0.5% |
| 0.03110074997 | 1 | 0.5% |
| 0.05659192801 | 1 | 0.5% |
| 0.05692410469 | 1 | 0.5% |
| 0.08299434185 | 1 | 0.5% |
| 0.08736741543 | 1 | 0.5% |
| 0.1200658083 | 1 | 0.5% |
| 0.1280924678 | 1 | 0.5% |
| 0.1383602619 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 8.805528793 | 1 | |
| 5.926186346 | 1 | |
| 5.665635705 | 1 | |
| 5.363589868 | 1 | |
| 4.714640707 | 1 | |
| 4.614361033 | 1 | |
| 4.563309059 | 1 | |
| 4.48809563 | 1 | |
| 4.398849934 | 1 | |
| 4.370397478 | 1 |
lda_distance
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.82431813 |
| Minimum | 0 |
|---|---|
| Maximum | 5.3102013 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.57985105 |
| Q3 | 1.1500078 |
| 95-th percentile | 2.6878081 |
| Maximum | 5.3102013 |
| Range | 5.3102013 |
| Interquartile range (IQR) | 1.1500078 |
Descriptive statistics
| Standard deviation | 0.96643452 |
|---|---|
| Coefficient of variation (CV) | 1.1724048 |
| Kurtosis | 3.2567221 |
| Mean | 0.82431813 |
| Median Absolute Deviation (MAD) | 0.57985105 |
| Skewness | 1.6753087 |
| Sum | 159.0934 |
| Variance | 0.93399569 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 2.182414969 | 1 | 0.5% |
| 1.135737341 | 1 | 0.5% |
| 0.7338111011 | 1 | 0.5% |
| 2.598592762 | 1 | 0.5% |
| 1.352163196 | 1 | 0.5% |
| 0.5455480628 | 1 | 0.5% |
| 0.3011339598 | 1 | 0.5% |
| 0.2367953335 | 1 | 0.5% |
| 2.341969053 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.009009424209 | 1 | 0.5% |
| 0.106568484 | 1 | 0.5% |
| 0.1136503332 | 1 | 0.5% |
| 0.23170049 | 1 | 0.5% |
| 0.2367953335 | 1 | 0.5% |
| 0.2703033666 | 1 | 0.5% |
| 0.2703666181 | 1 | 0.5% |
| 0.2783459033 | 1 | 0.5% |
| 0.2847315518 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 5.310201276 | 1 | |
| 4.299676602 | 1 | |
| 3.981688847 | 1 | |
| 3.608418739 | 1 | |
| 3.518476125 | 1 | |
| 3.422846227 | 1 | |
| 3.152002994 | 1 | |
| 3.026661364 | 1 | |
| 3.014511213 | 1 | |
| 2.813127573 | 1 |
use_distance
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.17740448 |
| Minimum | 0 |
|---|---|
| Maximum | 1.8396432 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.084286977 |
| Q3 | 0.24213743 |
| 95-th percentile | 0.73340388 |
| Maximum | 1.8396432 |
| Range | 1.8396432 |
| Interquartile range (IQR) | 0.24213743 |
Descriptive statistics
| Standard deviation | 0.25137565 |
|---|---|
| Coefficient of variation (CV) | 1.4169634 |
| Kurtosis | 10.673291 |
| Mean | 0.17740448 |
| Median Absolute Deviation (MAD) | 0.084286977 |
| Skewness | 2.667429 |
| Sum | 34.239064 |
| Variance | 0.063189715 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.2721637196 | 1 | 0.5% |
| 0.09802493714 | 1 | 0.5% |
| 0.3285916033 | 1 | 0.5% |
| 0.2856437567 | 1 | 0.5% |
| 0.122433367 | 1 | 0.5% |
| 0.07973911799 | 1 | 0.5% |
| 0.08428697711 | 1 | 0.5% |
| 0.02295066144 | 1 | 0.5% |
| 0.5079892852 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.005343358713 | 1 | 0.5% |
| 0.010909584 | 1 | 0.5% |
| 0.02295066144 | 1 | 0.5% |
| 0.02483414906 | 1 | 0.5% |
| 0.02570967874 | 1 | 0.5% |
| 0.03210179473 | 1 | 0.5% |
| 0.03364362066 | 1 | 0.5% |
| 0.03753879925 | 1 | 0.5% |
| 0.03899271319 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 1.839643225 | 1 | |
| 0.9981496239 | 1 | |
| 0.9761781719 | 1 | |
| 0.9259964936 | 1 | |
| 0.8976717033 | 1 | |
| 0.8100906049 | 1 | |
| 0.8050738149 | 1 | |
| 0.7783798098 | 1 | |
| 0.7424658286 | 1 | |
| 0.7404241889 | 1 |
stm_distance
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1313376 |
| Minimum | 0 |
|---|---|
| Maximum | 13.761275 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.53334203 |
| Q3 | 1.4531487 |
| 95-th percentile | 4.3256491 |
| Maximum | 13.761275 |
| Range | 13.761275 |
| Interquartile range (IQR) | 1.4531487 |
Descriptive statistics
| Standard deviation | 1.7397494 |
|---|---|
| Coefficient of variation (CV) | 1.537781 |
| Kurtosis | 15.777215 |
| Mean | 1.1313376 |
| Median Absolute Deviation (MAD) | 0.53334203 |
| Skewness | 3.2375335 |
| Sum | 218.34815 |
| Variance | 3.0267281 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 1.852420044 | 1 | 0.5% |
| 1.559017085 | 1 | 0.5% |
| 0.7927069066 | 1 | 0.5% |
| 4.211583706 | 1 | 0.5% |
| 2.577173316 | 1 | 0.5% |
| 1.420003692 | 1 | 0.5% |
| 0.3574582867 | 1 | 0.5% |
| 0.4350027129 | 1 | 0.5% |
| 3.361402 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.06664675472 | 1 | 0.5% |
| 0.103935994 | 1 | 0.5% |
| 0.2493890969 | 1 | 0.5% |
| 0.2554971727 | 1 | 0.5% |
| 0.2564407247 | 1 | 0.5% |
| 0.2853324978 | 1 | 0.5% |
| 0.2884652441 | 1 | 0.5% |
| 0.3030384246 | 1 | 0.5% |
| 0.324062172 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 13.761275 | 1 | |
| 7.655179479 | 1 | |
| 7.011777933 | 1 | |
| 6.548253206 | 1 | |
| 6.469361033 | 1 | |
| 5.623032088 | 1 | |
| 5.45806128 | 1 | |
| 5.290515012 | 1 | |
| 4.525869525 | 1 | |
| 4.496747288 | 1 |
| 1st_2nd_lda | 1st_2nd_lda_adj | 1st_2nd_stm | 1st_2nd_stm_adj | 1st_2nd_tfidf | 1st_2nd_tfidf_adj | 1st_2nd_use | 1st_2nd_use_adj | 1st_const_year | 1st_curr_lda_adj | 1st_curr_stm_adj | 1st_curr_tfidf_adj | 1st_curr_use_adj | 1st_current_lda | 1st_current_stm | 1st_current_tfidf | 1st_current_use | 2nd_const_year | constitutional_time | first_regime_time | lda_distance | stm_distance | tfidf_distance | use_distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1st_2nd_lda | 1.000 | 0.121 | 0.717 | -0.085 | 0.538 | -0.041 | 0.593 | 0.088 | -0.128 | 0.276 | 0.176 | 0.288 | 0.326 | 0.467 | 0.312 | 0.360 | 0.386 | 0.198 | 0.128 | 0.427 | 0.219 | 0.091 | 0.096 | 0.265 |
| 1st_2nd_lda_adj | 0.121 | 1.000 | 0.177 | 0.841 | 0.115 | 0.732 | 0.150 | 0.730 | 0.224 | 0.345 | 0.343 | 0.283 | 0.287 | 0.135 | 0.170 | 0.226 | 0.084 | -0.128 | -0.224 | -0.772 | 0.332 | 0.299 | 0.282 | 0.293 |
| 1st_2nd_stm | 0.717 | 0.177 | 1.000 | 0.251 | 0.464 | 0.064 | 0.529 | 0.223 | -0.118 | 0.199 | 0.287 | 0.213 | 0.212 | 0.375 | 0.496 | 0.366 | 0.278 | 0.060 | 0.118 | 0.219 | 0.232 | 0.296 | 0.107 | 0.299 |
| 1st_2nd_stm_adj | -0.085 | 0.841 | 0.251 | 1.000 | 0.029 | 0.702 | 0.065 | 0.747 | 0.234 | 0.236 | 0.347 | 0.196 | 0.180 | 0.034 | 0.198 | 0.191 | -0.016 | -0.174 | -0.234 | -0.852 | 0.244 | 0.327 | 0.192 | 0.240 |
| 1st_2nd_tfidf | 0.538 | 0.115 | 0.464 | 0.029 | 1.000 | 0.450 | 0.420 | 0.155 | -0.345 | -0.016 | -0.062 | 0.393 | 0.208 | 0.407 | 0.382 | 0.679 | 0.392 | -0.227 | 0.345 | 0.186 | 0.454 | 0.377 | 0.609 | 0.461 |
| 1st_2nd_tfidf_adj | -0.041 | 0.732 | 0.064 | 0.702 | 0.450 | 1.000 | 0.053 | 0.598 | -0.018 | 0.109 | 0.114 | 0.405 | 0.170 | 0.208 | 0.265 | 0.475 | 0.135 | -0.356 | 0.018 | -0.703 | 0.469 | 0.441 | 0.601 | 0.398 |
| 1st_2nd_use | 0.593 | 0.150 | 0.529 | 0.065 | 0.420 | 0.053 | 1.000 | 0.579 | -0.176 | 0.055 | 0.001 | 0.207 | 0.456 | 0.241 | 0.191 | 0.389 | 0.601 | 0.021 | 0.176 | 0.187 | 0.159 | 0.094 | 0.124 | 0.579 |
| 1st_2nd_use_adj | 0.088 | 0.730 | 0.223 | 0.747 | 0.155 | 0.598 | 0.579 | 1.000 | 0.141 | 0.204 | 0.220 | 0.291 | 0.450 | 0.077 | 0.135 | 0.319 | 0.345 | -0.131 | -0.141 | -0.636 | 0.241 | 0.226 | 0.208 | 0.528 |
| 1st_const_year | -0.128 | 0.224 | -0.118 | 0.234 | -0.345 | -0.018 | -0.176 | 0.141 | 1.000 | 0.701 | 0.665 | 0.395 | 0.305 | -0.362 | -0.389 | -0.327 | -0.380 | 0.801 | -1.000 | -0.144 | -0.573 | -0.554 | -0.562 | -0.527 |
| 1st_curr_lda_adj | 0.276 | 0.345 | 0.199 | 0.236 | -0.016 | 0.109 | 0.055 | 0.204 | 0.701 | 1.000 | 0.897 | 0.670 | 0.583 | 0.283 | 0.140 | 0.114 | 0.019 | 0.650 | -0.701 | -0.122 | -0.003 | -0.016 | -0.110 | -0.034 |
| 1st_curr_stm_adj | 0.176 | 0.343 | 0.287 | 0.347 | -0.062 | 0.114 | 0.001 | 0.220 | 0.665 | 0.897 | 1.000 | 0.589 | 0.550 | 0.221 | 0.327 | 0.094 | 0.011 | 0.565 | -0.665 | -0.203 | -0.005 | 0.102 | -0.097 | -0.064 |
| 1st_curr_tfidf_adj | 0.288 | 0.283 | 0.213 | 0.196 | 0.393 | 0.405 | 0.207 | 0.291 | 0.395 | 0.670 | 0.589 | 1.000 | 0.577 | 0.266 | 0.191 | 0.616 | 0.191 | 0.371 | -0.395 | -0.075 | 0.155 | 0.111 | 0.254 | 0.182 |
| 1st_curr_use_adj | 0.326 | 0.287 | 0.212 | 0.180 | 0.208 | 0.170 | 0.456 | 0.450 | 0.305 | 0.583 | 0.550 | 0.577 | 1.000 | 0.244 | 0.218 | 0.304 | 0.705 | 0.309 | -0.305 | -0.074 | 0.172 | 0.151 | 0.171 | 0.430 |
| 1st_current_lda | 0.467 | 0.135 | 0.375 | 0.034 | 0.407 | 0.208 | 0.241 | 0.077 | -0.362 | 0.283 | 0.221 | 0.266 | 0.244 | 1.000 | 0.775 | 0.596 | 0.466 | -0.234 | 0.362 | 0.145 | 0.644 | 0.569 | 0.472 | 0.441 |
| 1st_current_stm | 0.312 | 0.170 | 0.496 | 0.198 | 0.382 | 0.265 | 0.191 | 0.135 | -0.389 | 0.140 | 0.327 | 0.191 | 0.218 | 0.775 | 1.000 | 0.572 | 0.470 | -0.354 | 0.389 | 0.022 | 0.650 | 0.759 | 0.543 | 0.450 |
| 1st_current_tfidf | 0.360 | 0.226 | 0.366 | 0.191 | 0.679 | 0.475 | 0.389 | 0.319 | -0.327 | 0.114 | 0.094 | 0.616 | 0.304 | 0.596 | 0.572 | 1.000 | 0.462 | -0.300 | 0.327 | 0.006 | 0.548 | 0.505 | 0.607 | 0.515 |
| 1st_current_use | 0.386 | 0.084 | 0.278 | -0.016 | 0.392 | 0.135 | 0.601 | 0.345 | -0.380 | 0.019 | 0.011 | 0.191 | 0.705 | 0.466 | 0.470 | 0.462 | 1.000 | -0.251 | 0.380 | 0.136 | 0.490 | 0.454 | 0.458 | 0.730 |
| 2nd_const_year | 0.198 | -0.128 | 0.060 | -0.174 | -0.227 | -0.356 | 0.021 | -0.131 | 0.801 | 0.650 | 0.565 | 0.371 | 0.309 | -0.234 | -0.354 | -0.300 | -0.251 | 1.000 | -0.801 | 0.213 | -0.580 | -0.578 | -0.593 | -0.429 |
| constitutional_time | 0.128 | -0.224 | 0.118 | -0.234 | 0.345 | 0.018 | 0.176 | -0.141 | -1.000 | -0.701 | -0.665 | -0.395 | -0.305 | 0.362 | 0.389 | 0.327 | 0.380 | -0.801 | 1.000 | 0.144 | 0.573 | 0.554 | 0.562 | 0.527 |
| first_regime_time | 0.427 | -0.772 | 0.219 | -0.852 | 0.186 | -0.703 | 0.187 | -0.636 | -0.144 | -0.122 | -0.203 | -0.075 | -0.074 | 0.145 | 0.022 | 0.006 | 0.136 | 0.213 | 0.144 | 1.000 | -0.471 | -0.493 | -0.485 | -0.452 |
| lda_distance | 0.219 | 0.332 | 0.232 | 0.244 | 0.454 | 0.469 | 0.159 | 0.241 | -0.573 | -0.003 | -0.005 | 0.155 | 0.172 | 0.644 | 0.650 | 0.548 | 0.490 | -0.580 | 0.573 | -0.471 | 1.000 | 0.973 | 0.962 | 0.923 |
| stm_distance | 0.091 | 0.299 | 0.296 | 0.327 | 0.377 | 0.441 | 0.094 | 0.226 | -0.554 | -0.016 | 0.102 | 0.111 | 0.151 | 0.569 | 0.759 | 0.505 | 0.454 | -0.578 | 0.554 | -0.493 | 0.973 | 1.000 | 0.949 | 0.898 |
| tfidf_distance | 0.096 | 0.282 | 0.107 | 0.192 | 0.609 | 0.601 | 0.124 | 0.208 | -0.562 | -0.110 | -0.097 | 0.254 | 0.171 | 0.472 | 0.543 | 0.607 | 0.458 | -0.593 | 0.562 | -0.485 | 0.962 | 0.949 | 1.000 | 0.905 |
| use_distance | 0.265 | 0.293 | 0.299 | 0.240 | 0.461 | 0.398 | 0.579 | 0.528 | -0.527 | -0.034 | -0.064 | 0.182 | 0.430 | 0.441 | 0.450 | 0.515 | 0.730 | -0.429 | 0.527 | -0.452 | 0.923 | 0.898 | 0.905 | 1.000 |
| code | country | 1st_const_year | 2nd_const_year | 1st_2nd_tfidf | 1st_current_tfidf | 1st_2nd_lda | 1st_current_lda | 1st_2nd_use | 1st_current_use | 1st_2nd_stm | 1st_current_stm | constitutional_time | first_regime_time | 1st_2nd_tfidf_adj | 1st_2nd_lda_adj | 1st_2nd_use_adj | 1st_2nd_stm_adj | 1st_curr_tfidf_adj | 1st_curr_lda_adj | 1st_curr_use_adj | 1st_curr_stm_adj | tfidf_distance | lda_distance | use_distance | stm_distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | AFG | Afghanistan | 1923 | 1931.0 | 0.353630 | 0.303517 | 0.490204 | 0.634149 | 0.055123 | 0.166964 | 0.180809 | 0.351458 | 99 | 8.0 | 0.044204 | 0.061275 | 0.006890 | 0.022601 | 0.003066 | 0.006406 | 0.001687 | 0.003550 | 2.554667 | 2.182415 | 0.272164 | 1.852420 |
| 1 | ALB | Albania | 1925 | 1928.0 | 0.513447 | 0.636341 | 0.437473 | 0.788622 | 0.082878 | 0.102103 | 0.526745 | 0.751644 | 97 | 3.0 | 0.171149 | 0.145824 | 0.027626 | 0.175582 | 0.006560 | 0.008130 | 0.001053 | 0.007749 | 2.818089 | 2.412450 | 0.778380 | 3.296292 |
| 2 | DZA | Algeria | 1963 | 1996.0 | 0.650947 | 0.650947 | 0.536840 | 0.536840 | 0.132969 | 0.132969 | 0.483908 | 0.483908 | 59 | 33.0 | 0.019726 | 0.016268 | 0.004029 | 0.014664 | 0.011033 | 0.009099 | 0.002254 | 0.008202 | 0.650947 | 0.536840 | 0.132969 | 0.483908 |
| 3 | AND | Andorra | 1993 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 29 | 29.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 4 | AGO | Angola | 1975 | 2010.0 | 0.317443 | 0.317443 | 0.571623 | 0.571623 | 0.305411 | 0.305411 | 0.489318 | 0.489318 | 47 | 35.0 | 0.009070 | 0.016332 | 0.008726 | 0.013981 | 0.006754 | 0.012162 | 0.006498 | 0.010411 | 0.317443 | 0.571623 | 0.305411 | 0.489318 |
| 5 | ATG | Antigua Barbuda | 1981 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 41 | 41.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 6 | ARG | Argentina | 1819 | 1826.0 | 0.505489 | 0.794305 | 0.251727 | 0.733257 | 0.014585 | 0.043938 | 0.252834 | 0.461541 | 203 | 7.0 | 0.072213 | 0.035961 | 0.002084 | 0.036119 | 0.003913 | 0.003612 | 0.000216 | 0.002274 | 1.149967 | 0.881207 | 0.037539 | 0.714374 |
| 7 | ARM | Armenia | 1995 | 2005.0 | 0.064511 | 0.371961 | 0.122443 | 0.377732 | 0.034281 | 0.096356 | 0.180414 | 0.352928 | 27 | 10.0 | 0.006451 | 0.012244 | 0.003428 | 0.018041 | 0.013776 | 0.013990 | 0.003569 | 0.013071 | 0.389074 | 0.419085 | 0.084017 | 0.533342 |
| 8 | AUS | Australia | 1901 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 121 | 121.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 9 | AUT | Austria | 1920 | 1934.0 | 0.338217 | 0.338217 | 0.113650 | 0.113650 | 0.048903 | 0.048903 | 0.403290 | 0.403290 | 102 | 14.0 | 0.024158 | 0.008118 | 0.003493 | 0.028806 | 0.003316 | 0.001114 | 0.000479 | 0.003954 | 0.338217 | 0.113650 | 0.048903 | 0.403290 |
| code | country | 1st_const_year | 2nd_const_year | 1st_2nd_tfidf | 1st_current_tfidf | 1st_2nd_lda | 1st_current_lda | 1st_2nd_use | 1st_current_use | 1st_2nd_stm | 1st_current_stm | constitutional_time | first_regime_time | 1st_2nd_tfidf_adj | 1st_2nd_lda_adj | 1st_2nd_use_adj | 1st_2nd_stm_adj | 1st_curr_tfidf_adj | 1st_curr_lda_adj | 1st_curr_use_adj | 1st_curr_stm_adj | tfidf_distance | lda_distance | use_distance | stm_distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 183 | USA | United states | 1789 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 233 | 233.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 184 | URY | Uruguay | 1830 | 1918.0 | 0.070556 | 0.852828 | 0.230569 | 0.661282 | 0.024275 | 0.201772 | 0.236573 | 0.582161 | 192 | 88.0 | 0.000802 | 0.002620 | 0.000276 | 0.002688 | 0.004442 | 0.003444 | 0.001051 | 0.003032 | 1.279092 | 0.927896 | 0.171229 | 1.410233 |
| 185 | UZB | Uzbekistan | 1992 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 30 | 30.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 186 | VUT | Vanuatu | 1979 | 1980.0 | 0.031101 | 0.031101 | 0.009009 | 0.009009 | 0.005343 | 0.005343 | 0.066647 | 0.066647 | 43 | 1.0 | 0.031101 | 0.009009 | 0.005343 | 0.066647 | 0.000723 | 0.000210 | 0.000124 | 0.001550 | 0.031101 | 0.009009 | 0.005343 | 0.066647 |
| 187 | VEN | Venezuela | 1830 | 1858.0 | 0.495311 | 0.690389 | 0.181846 | 0.760860 | 0.047112 | 0.185052 | 0.184205 | 0.780078 | 192 | 28.0 | 0.017690 | 0.006495 | 0.001683 | 0.006579 | 0.003596 | 0.003963 | 0.000964 | 0.004063 | 5.665636 | 3.608419 | 0.503704 | 13.761275 |
| 188 | VDR | Vietnam | 1960 | 1980.0 | 0.885566 | 0.891462 | 0.240032 | 0.263229 | 0.052652 | 0.056959 | 0.253492 | 0.228797 | 62 | 20.0 | 0.044278 | 0.012002 | 0.002633 | 0.012675 | 0.014378 | 0.004246 | 0.000919 | 0.003690 | 1.157357 | 0.482546 | 0.071202 | 0.482289 |
| 189 | YEM | Yemen Arab Republic | 1970 | 1991.0 | 0.913529 | 0.913529 | 0.647501 | 0.647501 | 0.065579 | 0.065579 | 0.663185 | 0.663185 | 52 | 21.0 | 0.043501 | 0.030833 | 0.003123 | 0.031580 | 0.017568 | 0.012452 | 0.001261 | 0.012754 | 0.913529 | 0.647501 | 0.065579 | 0.663185 |
| 190 | YUG | Yugoslavia | 1921 | 1931.0 | 0.736563 | 0.929297 | 0.484082 | 0.731703 | 0.035465 | 0.133398 | 0.453762 | 0.687188 | 101 | 10.0 | 0.073656 | 0.048408 | 0.003546 | 0.045376 | 0.009201 | 0.007245 | 0.001321 | 0.006804 | 4.488096 | 3.981689 | 0.742466 | 5.623032 |
| 191 | ZMB | Zambia | 1964 | 1973.0 | 0.475826 | 0.086892 | 0.110700 | 0.148269 | 0.007083 | 0.009547 | 0.053468 | 0.202029 | 58 | 9.0 | 0.052870 | 0.012300 | 0.000787 | 0.005941 | 0.001498 | 0.002556 | 0.000165 | 0.003483 | 0.963551 | 0.231700 | 0.024834 | 0.255497 |
| 192 | ZWE | Zimbabwe | 1965 | 1969.0 | 0.249281 | 0.912270 | 0.370943 | 0.824679 | 0.032659 | 0.092031 | 0.329574 | 0.688878 | 57 | 4.0 | 0.062320 | 0.092736 | 0.008165 | 0.082393 | 0.016005 | 0.014468 | 0.001615 | 0.012086 | 1.118774 | 1.431619 | 0.132359 | 1.478264 |